Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 21714 |
| Missing cells | 35814 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.5 MiB |
| Average record size in memory | 168.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 10 |
property_type has constant value "21714" | Constant |
country has constant value "21714" | Constant |
city has constant value "21714" | Constant |
current_zones has a high cardinality: 595 distinct values | High cardinality |
zone has a high cardinality: 419 distinct values | High cardinality |
century_zone has a high cardinality: 141 distinct values | High cardinality |
property_status has 1342 (6.2%) missing values | Missing |
current_zones has 1057 (4.9%) missing values | Missing |
zone has 1057 (4.9%) missing values | Missing |
century_zone has 2052 (9.5%) missing values | Missing |
other_rooms has 2450 (11.3%) missing values | Missing |
year_of_construction has 3594 (16.6%) missing values | Missing |
year_of_renovation has 3595 (16.6%) missing values | Missing |
closed_price has 20656 (95.1%) missing values | Missing |
price is highly skewed (γ1 = 64.5158093) | Skewed |
interior_area is highly skewed (γ1 = 30.34516288) | Skewed |
gros_area is highly skewed (γ1 = 139.8331842) | Skewed |
year_of_construction is highly skewed (γ1 = 105.0771713) | Skewed |
year_of_renovation is highly skewed (γ1 = 21.21459507) | Skewed |
df_index has unique values | Unique |
price has 697 (3.2%) zeros | Zeros |
interior_area has 8375 (38.6%) zeros | Zeros |
gros_area has 2903 (13.4%) zeros | Zeros |
bedrooms has 1713 (7.9%) zeros | Zeros |
bathrooms has 1994 (9.2%) zeros | Zeros |
other_rooms has 17999 (82.9%) zeros | Zeros |
year_of_construction has 16114 (74.2%) zeros | Zeros |
year_of_renovation has 18079 (83.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-25 17:39:00.123820 |
|---|---|
| Analysis finished | 2021-05-25 17:39:22.113093 |
| Duration | 21.99 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 21714 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10857.26292 |
|---|---|
| Minimum | 0 |
| Maximum | 21736 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1085.65 |
| Q1 | 5428.25 |
| median | 10856.5 |
| Q3 | 16284.75 |
| 95-th percentile | 20630.35 |
| Maximum | 21736 |
| Range | 21736 |
| Interquartile range (IQR) | 10856.5 |
Descriptive statistics
| Standard deviation | 6269.707704 |
|---|---|
| Coefficient of variation (CV) | 0.5774666923 |
| Kurtosis | -1.199202011 |
| Mean | 10857.26292 |
| Median Absolute Deviation (MAD) | 5428.5 |
| Skewness | 0.0006495971522 |
| Sum | 235754607 |
| Variance | 39309234.7 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 21151 | 1 | < 0.1% | |
| 14994 | 1 | < 0.1% | |
| 12947 | 1 | < 0.1% | |
| 2708 | 1 | < 0.1% | |
| 661 | 1 | < 0.1% | |
| 6806 | 1 | < 0.1% | |
| 4759 | 1 | < 0.1% | |
| 19100 | 1 | < 0.1% | |
| 17053 | 1 | < 0.1% | |
| Other values (21704) | 21704 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 21736 | 1 | < 0.1% | |
| 21735 | 1 | < 0.1% | |
| 21734 | 1 | < 0.1% | |
| 21733 | 1 | < 0.1% | |
| 21732 | 1 | < 0.1% |
propertiesid
Real number (ℝ≥0)
| Distinct | 20844 |
|---|---|
| Distinct (%) | 96.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 209069.6781 |
|---|---|
| Minimum | 2645 |
| Maximum | 948301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 2645 |
|---|---|
| 5-th percentile | 4401.65 |
| Q1 | 13089.25 |
| median | 87979 |
| Q3 | 386907.75 |
| 95-th percentile | 847437.25 |
| Maximum | 948301 |
| Range | 945656 |
| Interquartile range (IQR) | 373818.5 |
Descriptive statistics
| Standard deviation | 280007.4863 |
|---|---|
| Coefficient of variation (CV) | 1.339302231 |
| Kurtosis | 0.2908881723 |
| Mean | 209069.6781 |
| Median Absolute Deviation (MAD) | 77778.5 |
| Skewness | 1.302112205 |
| Sum | 4539738991 |
| Variance | 7.840419236e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 8970 | 3 | < 0.1% | |
| 9616 | 3 | < 0.1% | |
| 13616 | 3 | < 0.1% | |
| 9808 | 3 | < 0.1% | |
| 9602 | 3 | < 0.1% | |
| 10922 | 3 | < 0.1% | |
| 10932 | 3 | < 0.1% | |
| 9352 | 3 | < 0.1% | |
| 9080 | 3 | < 0.1% | |
| 8972 | 3 | < 0.1% | |
| Other values (20834) | 21684 | 99.9% |
| Value | Count | Frequency (%) | |
| 2645 | 1 | < 0.1% | |
| 2648 | 1 | < 0.1% | |
| 2649 | 1 | < 0.1% | |
| 2650 | 1 | < 0.1% | |
| 2651 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 948301 | 1 | < 0.1% | |
| 948275 | 1 | < 0.1% | |
| 948219 | 1 | < 0.1% | |
| 947861 | 1 | < 0.1% | |
| 947596 | 1 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 169.6 KiB |
| Apartment |
|---|
| Value | Count | Frequency (%) | |
| Apartment | 21714 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1342 |
| Missing (%) | 6.2% |
| Memory size | 169.6 KiB |
| Used | |
|---|---|
| New | |
| Under Construction | 748 |
| In project | 11 |
| Remodelled | 6 |
| Other values (4) | 15 |
| Value | Count | Frequency (%) | |
| Used | 12210 | 56.2% | |
| New | 7382 | 34.0% | |
| Under Construction | 748 | 3.4% | |
| In project | 11 | 0.1% | |
| Remodelled | 6 | < 0.1% | |
| Not Applicable | 5 | < 0.1% | |
| Refurbished | 5 | < 0.1% | |
| For refurbishment | 4 | < 0.1% | |
| To demolish or rebuild | 1 | < 0.1% | |
| (Missing) | 1342 | 6.2% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 22 |
|---|---|
| Median length | 4 |
| Mean length | 4.092336741 |
| Min length | 3 |
availability
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 169.6 KiB |
| Withdrawn | |
|---|---|
| Sold | |
| Available | |
| In evaluation | 300 |
| Reserved | 84 |
| Other values (3) | 80 |
| Value | Count | Frequency (%) | |
| Withdrawn | 13139 | 60.5% | |
| Sold | 4181 | 19.3% | |
| Available | 3929 | 18.1% | |
| In evaluation | 300 | 1.4% | |
| Reserved | 84 | 0.4% | |
| Rented | 63 | 0.3% | |
| In negotiation | 16 | 0.1% | |
| Potential | 1 | < 0.1% | |
| (Missing) | 1 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 8.08335636 |
| Min length | 3 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 169.6 KiB |
| Albania |
|---|
| Value | Count | Frequency (%) | |
| Albania | 21714 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
division
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 169.6 KiB |
| Tirana | |
|---|---|
| Durres | 5 |
| Budva | 1 |
| Berat | 1 |
| Elbasan | 1 |
| Value | Count | Frequency (%) | |
| Tirana | 21699 | 99.9% | |
| Durres | 5 | < 0.1% | |
| Budva | 1 | < 0.1% | |
| Berat | 1 | < 0.1% | |
| Elbasan | 1 | < 0.1% | |
| (Missing) | 7 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.998986829 |
| Min length | 3 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 169.6 KiB |
| Tirana |
|---|
| Value | Count | Frequency (%) | |
| Tirana | 21714 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
| Distinct | 595 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 1057 |
| Missing (%) | 4.9% |
| Memory size | 169.6 KiB |
| Fresku | 1509 |
|---|---|
| Unaza e re | 1116 |
| Komuna e Parisit | 723 |
| Rruga e Kavajes | 502 |
| Kodra e Diellit | 502 |
| Other values (590) |
| Value | Count | Frequency (%) | |
| Fresku | 1509 | 6.9% | |
| Unaza e re | 1116 | 5.1% | |
| Komuna e Parisit | 723 | 3.3% | |
| Rruga e Kavajes | 502 | 2.3% | |
| Kodra e Diellit | 502 | 2.3% | |
| Astiri | 499 | 2.3% | |
| 21 Dhjetori | 468 | 2.2% | |
| Don Bosko | 413 | 1.9% | |
| Ali Demi | 411 | 1.9% | |
| Liqeni i Thatë | 377 | 1.7% | |
| Other values (585) | 14137 | 65.1% | |
| (Missing) | 1057 | 4.9% |
Frequencies of value counts
Unique
| Unique | 251 ? |
|---|---|
| Unique (%) | 1.2% |
Histogram of lengths of the category
Length
| Max length | 96 |
|---|---|
| Median length | 11 |
| Mean length | 12.3328728 |
| Min length | 3 |
| Distinct | 419 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 1057 |
| Missing (%) | 4.9% |
| Memory size | 169.6 KiB |
| Fresku | 1510 |
|---|---|
| Unaza e re | 1149 |
| Komuna e Parisit | 729 |
| Rruga e Kavajes | 510 |
| Astiri | 509 |
| Other values (414) |
| Value | Count | Frequency (%) | |
| Fresku | 1510 | 7.0% | |
| Unaza e re | 1149 | 5.3% | |
| Komuna e Parisit | 729 | 3.4% | |
| Rruga e Kavajes | 510 | 2.3% | |
| Astiri | 509 | 2.3% | |
| 21 Dhjetori | 468 | 2.2% | |
| Don Bosko | 420 | 1.9% | |
| Ali Demi | 417 | 1.9% | |
| Liqeni i Thatë | 378 | 1.7% | |
| Kodra e Diellit | 376 | 1.7% | |
| Other values (409) | 14191 | 65.4% | |
| (Missing) | 1057 | 4.9% |
Frequencies of value counts
Unique
| Unique | 108 ? |
|---|---|
| Unique (%) | 0.5% |
Histogram of lengths of the category
Length
| Max length | 45 |
|---|---|
| Median length | 11 |
| Mean length | 12.01188174 |
| Min length | 2 |
| Distinct | 141 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 2052 |
| Missing (%) | 9.5% |
| Memory size | 169.6 KiB |
| Fresku | |
|---|---|
| Unaza e Re | 1234 |
| Don Bosco | 864 |
| 21 Dhjetori | 820 |
| Ali Demi | 754 |
| Other values (136) |
| Value | Count | Frequency (%) | |
| Fresku | 1555 | 7.2% | |
| Unaza e Re | 1234 | 5.7% | |
| Don Bosco | 864 | 4.0% | |
| 21 Dhjetori | 820 | 3.8% | |
| Ali Demi | 754 | 3.5% | |
| Komuna e Parisit | 729 | 3.4% | |
| Liqeni i Thatë | 705 | 3.2% | |
| Astiri | 544 | 2.5% | |
| Rruga e Kavajes | 511 | 2.4% | |
| Yzberish | 509 | 2.3% | |
| Other values (131) | 11437 | 52.7% | |
| (Missing) | 2052 | 9.5% |
Frequencies of value counts
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 37 |
|---|---|
| Median length | 10 |
| Mean length | 11.36423506 |
| Min length | 3 |
| Distinct | 2493 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95521.109 |
|---|---|
| Minimum | 0 |
| Maximum | 14087000 |
| Zeros | 697 |
| Zeros (%) | 3.2% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 118.65 |
| Q1 | 57000 |
| median | 79900 |
| Q3 | 113000 |
| 95-th percentile | 210740 |
| Maximum | 14087000 |
| Range | 14087000 |
| Interquartile range (IQR) | 56000 |
Descriptive statistics
| Standard deviation | 127400.214 |
|---|---|
| Coefficient of variation (CV) | 1.333738849 |
| Kurtosis | 6768.538573 |
| Mean | 95521.109 |
| Median Absolute Deviation (MAD) | 25100 |
| Skewness | 64.5158093 |
| Sum | 2074145361 |
| Variance | 1.623081452e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 697 | 3.2% | |
| 65000 | 480 | 2.2% | |
| 75000 | 452 | 2.1% | |
| 85000 | 411 | 1.9% | |
| 55000 | 388 | 1.8% | |
| 70000 | 380 | 1.8% | |
| 60000 | 378 | 1.7% | |
| 80000 | 377 | 1.7% | |
| 90000 | 321 | 1.5% | |
| 100000 | 303 | 1.4% | |
| Other values (2483) | 17527 | 80.7% |
| Value | Count | Frequency (%) | |
| 0 | 697 | 3.2% | |
| 1 | 5 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 10 | 2 | < 0.1% | |
| 19 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 14087000 | 1 | < 0.1% | |
| 3823648 | 1 | < 0.1% | |
| 3200000 | 1 | < 0.1% | |
| 2500000 | 1 | < 0.1% | |
| 2400000 | 1 | < 0.1% |
| Distinct | 290 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.61027956 |
|---|---|
| Minimum | 0 |
| Maximum | 5700 |
| Zeros | 8375 |
| Zeros (%) | 38.6% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 60 |
| Q3 | 94 |
| 95-th percentile | 135 |
| Maximum | 5700 |
| Range | 5700 |
| Interquartile range (IQR) | 94 |
Descriptive statistics
| Standard deviation | 87.29917109 |
|---|---|
| Coefficient of variation (CV) | 1.598584951 |
| Kurtosis | 1621.855475 |
| Mean | 54.61027956 |
| Median Absolute Deviation (MAD) | 58 |
| Skewness | 30.34516288 |
| Sum | 1185753 |
| Variance | 7621.145274 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 8375 | 38.6% | |
| 2 | 668 | 3.1% | |
| 100 | 282 | 1.3% | |
| 1 | 266 | 1.2% | |
| 90 | 260 | 1.2% | |
| 95 | 222 | 1.0% | |
| 80 | 211 | 1.0% | |
| 94 | 209 | 1.0% | |
| 75 | 207 | 1.0% | |
| 110 | 207 | 1.0% | |
| Other values (280) | 10806 | 49.8% |
| Value | Count | Frequency (%) | |
| 0 | 8375 | 38.6% | |
| 1 | 266 | 1.2% | |
| 2 | 668 | 3.1% | |
| 3 | 173 | 0.8% | |
| 4 | 14 | 0.1% |
| Value | Count | Frequency (%) | |
| 5700 | 1 | < 0.1% | |
| 5000 | 1 | < 0.1% | |
| 3746 | 1 | < 0.1% | |
| 3600 | 1 | < 0.1% | |
| 2777 | 1 | < 0.1% |
| Distinct | 341 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.87100488 |
|---|---|
| Minimum | -2 |
| Maximum | 166600 |
| Zeros | 2903 |
| Zeros (%) | 13.4% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 61 |
| median | 90 |
| Q3 | 112 |
| 95-th percentile | 156 |
| Maximum | 166600 |
| Range | 166602 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 1151.229364 |
|---|---|
| Coefficient of variation (CV) | 12.13468083 |
| Kurtosis | 20163.39047 |
| Mean | 94.87100488 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 139.8331842 |
| Sum | 2060029 |
| Variance | 1325329.048 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 2903 | 13.4% | |
| 1 | 611 | 2.8% | |
| 2 | 492 | 2.3% | |
| 100 | 470 | 2.2% | |
| 110 | 391 | 1.8% | |
| 105 | 385 | 1.8% | |
| 75 | 325 | 1.5% | |
| 70 | 323 | 1.5% | |
| 120 | 314 | 1.4% | |
| 90 | 297 | 1.4% | |
| Other values (331) | 15203 | 70.0% |
| Value | Count | Frequency (%) | |
| -2 | 4 | < 0.1% | |
| 0 | 2903 | 13.4% | |
| 1 | 611 | 2.8% | |
| 2 | 492 | 2.3% | |
| 3 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 166600 | 1 | < 0.1% | |
| 20000 | 1 | < 0.1% | |
| 19000 | 1 | < 0.1% | |
| 7600 | 1 | < 0.1% | |
| 6450 | 1 | < 0.1% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.742470296 |
|---|---|
| Minimum | 0 |
| Maximum | 21 |
| Zeros | 1713 |
| Zeros (%) | 7.9% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8658430802 |
|---|---|
| Coefficient of variation (CV) | 0.4969055039 |
| Kurtosis | 16.99153747 |
| Mean | 1.742470296 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9846865158 |
| Sum | 37836 |
| Variance | 0.7496842395 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 2 | 11413 | 52.6% | |
| 1 | 5583 | 25.7% | |
| 3 | 2750 | 12.7% | |
| 0 | 1713 | 7.9% | |
| 4 | 200 | 0.9% | |
| 5 | 25 | 0.1% | |
| 6 | 11 | 0.1% | |
| 11 | 7 | < 0.1% | |
| 8 | 6 | < 0.1% | |
| 7 | 3 | < 0.1% | |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1713 | 7.9% | |
| 1 | 5583 | 25.7% | |
| 2 | 11413 | 52.6% | |
| 3 | 2750 | 12.7% | |
| 4 | 200 | 0.9% |
| Value | Count | Frequency (%) | |
| 21 | 1 | < 0.1% | |
| 11 | 7 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 6 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.299926308 |
|---|---|
| Minimum | 0 |
| Maximum | 52 |
| Zeros | 1994 |
| Zeros (%) | 9.2% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7517924134 |
|---|---|
| Coefficient of variation (CV) | 0.57833464 |
| Kurtosis | 974.5832978 |
| Mean | 1.299926308 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.07399576 |
| Sum | 28224 |
| Variance | 0.5651918328 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 1 | 11563 | 53.3% | |
| 2 | 7936 | 36.5% | |
| 0 | 1994 | 9.2% | |
| 3 | 179 | 0.8% | |
| 4 | 26 | 0.1% | |
| 6 | 7 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 52 | 1 | < 0.1% | |
| (Missing) | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1994 | 9.2% | |
| 1 | 11563 | 53.3% | |
| 2 | 7936 | 36.5% | |
| 3 | 179 | 0.8% | |
| 4 | 26 | 0.1% |
| Value | Count | Frequency (%) | |
| 52 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 6 | 7 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2450 |
| Missing (%) | 11.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09203696013 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 17999 |
| Zeros (%) | 82.9% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4032146734 |
|---|---|
| Coefficient of variation (CV) | 4.381008161 |
| Kurtosis | 45.43580949 |
| Mean | 0.09203696013 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.953485465 |
| Sum | 1773 |
| Variance | 0.1625820729 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 0 | 17999 | 82.9% | |
| 1 | 943 | 4.3% | |
| 2 | 184 | 0.8% | |
| 3 | 105 | 0.5% | |
| 4 | 22 | 0.1% | |
| 5 | 7 | < 0.1% | |
| 6 | 4 | < 0.1% | |
| (Missing) | 2450 | 11.3% |
| Value | Count | Frequency (%) | |
| 0 | 17999 | 82.9% | |
| 1 | 943 | 4.3% | |
| 2 | 184 | 0.8% | |
| 3 | 105 | 0.5% | |
| 4 | 22 | 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 4 | < 0.1% | |
| 5 | 7 | < 0.1% | |
| 4 | 22 | 0.1% | |
| 3 | 105 | 0.5% | |
| 2 | 184 | 0.8% |
| Distinct | 62 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 3594 |
| Missing (%) | 16.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 232.4990618 |
|---|---|
| Minimum | 0 |
| Maximum | 199636 |
| Zeros | 16114 |
| Zeros (%) | 74.2% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2016 |
| Maximum | 199636 |
| Range | 199636 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1609.666125 |
|---|---|
| Coefficient of variation (CV) | 6.923323099 |
| Kurtosis | 12998.78477 |
| Mean | 232.4990618 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 105.0771713 |
| Sum | 4212883 |
| Variance | 2591025.034 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 16114 | 74.2% | |
| 2021 | 408 | 1.9% | |
| 2020 | 278 | 1.3% | |
| 2010 | 179 | 0.8% | |
| 2005 | 101 | 0.5% | |
| 2019 | 91 | 0.4% | |
| 2008 | 72 | 0.3% | |
| 2000 | 62 | 0.3% | |
| 2012 | 61 | 0.3% | |
| 2015 | 59 | 0.3% | |
| Other values (52) | 695 | 3.2% | |
| (Missing) | 3594 | 16.6% |
| Value | Count | Frequency (%) | |
| 0 | 16114 | 74.2% | |
| 1 | 3 | < 0.1% | |
| 2 | 3 | < 0.1% | |
| 70 | 1 | < 0.1% | |
| 85 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 199636 | 1 | < 0.1% | |
| 2024 | 1 | < 0.1% | |
| 2023 | 2 | < 0.1% | |
| 2022 | 38 | 0.2% | |
| 2021 | 408 | 1.9% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3595 |
| Missing (%) | 16.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.451570175 |
|---|---|
| Minimum | 0 |
| Maximum | 2021 |
| Zeros | 18079 |
| Zeros (%) | 83.3% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2021 |
| Range | 2021 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 94.64193047 |
|---|---|
| Coefficient of variation (CV) | 21.26034787 |
| Kurtosis | 448.1109438 |
| Mean | 4.451570175 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.21459507 |
| Sum | 80658 |
| Variance | 8957.095002 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 0 | 18079 | 83.3% | |
| 2019 | 11 | 0.1% | |
| 2018 | 6 | < 0.1% | |
| 2021 | 5 | < 0.1% | |
| 2017 | 4 | < 0.1% | |
| 2015 | 4 | < 0.1% | |
| 2010 | 3 | < 0.1% | |
| 2020 | 2 | < 0.1% | |
| 2016 | 1 | < 0.1% | |
| 2009 | 1 | < 0.1% | |
| Other values (3) | 3 | < 0.1% | |
| (Missing) | 3595 | 16.6% |
| Value | Count | Frequency (%) | |
| 0 | 18079 | 83.3% | |
| 2000 | 1 | < 0.1% | |
| 2005 | 1 | < 0.1% | |
| 2008 | 1 | < 0.1% | |
| 2009 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2021 | 5 | < 0.1% | |
| 2020 | 2 | < 0.1% | |
| 2019 | 11 | 0.1% | |
| 2018 | 6 | < 0.1% | |
| 2017 | 4 | < 0.1% |
| Distinct | 260 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 20656 |
| Missing (%) | 95.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77390.3724 |
|---|---|
| Minimum | 150 |
| Maximum | 888000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 169.6 KiB |
Quantile statistics
| Minimum | 150 |
|---|---|
| 5-th percentile | 34000 |
| Q1 | 51000 |
| median | 65000 |
| Q3 | 87000 |
| 95-th percentile | 145750 |
| Maximum | 888000 |
| Range | 887850 |
| Interquartile range (IQR) | 36000 |
Descriptive statistics
| Standard deviation | 59664.72297 |
|---|---|
| Coefficient of variation (CV) | 0.7709579515 |
| Kurtosis | 70.73878212 |
| Mean | 77390.3724 |
| Median Absolute Deviation (MAD) | 17000 |
| Skewness | 6.612967691 |
| Sum | 81879014 |
| Variance | 3559879167 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 60000 | 28 | 0.1% | |
| 65000 | 27 | 0.1% | |
| 70000 | 26 | 0.1% | |
| 45000 | 23 | 0.1% | |
| 53000 | 21 | 0.1% | |
| 67000 | 20 | 0.1% | |
| 80000 | 18 | 0.1% | |
| 55000 | 18 | 0.1% | |
| 40000 | 18 | 0.1% | |
| 56000 | 17 | 0.1% | |
| Other values (250) | 842 | 3.9% | |
| (Missing) | 20656 | 95.1% |
| Value | Count | Frequency (%) | |
| 150 | 1 | < 0.1% | |
| 230 | 1 | < 0.1% | |
| 240 | 2 | < 0.1% | |
| 250 | 1 | < 0.1% | |
| 280 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 888000 | 1 | < 0.1% | |
| 870989 | 1 | < 0.1% | |
| 530000 | 1 | < 0.1% | |
| 470000 | 2 | < 0.1% | |
| 435000 | 1 | < 0.1% |
agency
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 169.6 KiB |
| century | |
|---|---|
| futurehome | |
| elite | |
| mei |
| Value | Count | Frequency (%) | |
| century | 10000 | 46.1% | |
| futurehome | 9023 | 41.6% | |
| elite | 1392 | 6.4% | |
| mei | 1299 | 6.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.879110251 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | century_zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | closed_price | agency | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 948301 | Apartment | Used | Available | Albania | Tirana | Tirana | Ish Ekspozita | Ish Ekspozita | Ish Ekspozita | 67525.0 | 42.0 | 42 | 1 | 1.0 | 0.0 | 1975.0 | 0.0 | NaN | century |
| 1 | 1 | 948275 | Apartment | Used | Available | Albania | Tirana | Tirana | Oxhaku | Oxhaku | Oxhaku | 59000.0 | 84.0 | 84 | 2 | 1.0 | 0.0 | 1985.0 | 0.0 | NaN | century |
| 2 | 2 | 948219 | Apartment | Used | Available | Albania | Tirana | Tirana | Zogu I Zi | Zogu I Zi | Zogu I Zi | 81500.0 | 90.0 | 90 | 2 | 2.0 | 0.0 | 2019.0 | 0.0 | NaN | century |
| 3 | 3 | 947861 | Apartment | Used | Available | Albania | Tirana | Tirana | Astiri | Astiri | Astiri | 60000.0 | 68.0 | 79 | 1 | 1.0 | 0.0 | 2015.0 | 0.0 | NaN | century |
| 4 | 4 | 947596 | Apartment | Used | Available | Albania | Tirana | Tirana | Institut Kamëz | Institut Kamëz | Institut Kamëz | 73450.0 | 96.0 | 114 | 2 | 1.0 | 0.0 | 2015.0 | 0.0 | NaN | century |
| 5 | 5 | 947569 | Apartment | New | Available | Albania | Tirana | Tirana | Rruga e Elbasanit | Rruga e Elbasanit | Rruga e Elbasanit | 235000.0 | 204.0 | 215 | 3 | 2.0 | 0.0 | 2016.0 | 0.0 | NaN | century |
| 6 | 6 | 947536 | Apartment | Used | Available | Albania | Tirana | Tirana | Institut Kamëz | Institut Kamëz | Institut Kamëz | 44200.0 | 60.0 | 69 | 1 | 1.0 | 0.0 | 2015.0 | 0.0 | NaN | century |
| 7 | 7 | 947462 | Apartment | New | Available | Albania | Tirana | Tirana | Hipoteka | Hipoteka | Hipoteka | 179000.0 | 118.0 | 128 | 2 | 2.0 | 0.0 | 2018.0 | 0.0 | NaN | century |
| 8 | 8 | 946773 | Apartment | Used | In evaluation | Albania | Tirana | Tirana | Tregu Elektrik | Tregu Elektrik | Tregu Elektrik | 650000.0 | 404.0 | 486 | 3 | 3.0 | 0.0 | 0.0 | 0.0 | NaN | century |
| 9 | 9 | 946757 | Apartment | Used | Available | Albania | Tirana | Tirana | Stadiumi Dinamo | Stadiumi Dinamo | Stadiumi Dinamo | 176000.0 | 108.0 | 126 | 3 | 2.0 | 1.0 | 1993.0 | 0.0 | NaN | century |
Last rows
| df_index | propertiesid | property_type | property_status | availability | country | division | city | current_zones | zone | century_zone | price | interior_area | gros_area | bedrooms | bathrooms | other_rooms | year_of_construction | year_of_renovation | closed_price | agency | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 21704 | 21727 | 2660 | Apartment | New | Sold | Albania | Tirana | Tirana | Unaza e re | Unaza e re | Unaza e Re | 55000.0 | 96.0 | 110 | 3 | 2.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21705 | 21728 | 2659 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Liqeni I Tiranes |##| Liqeni i Thate | Liqeni I Tiranes | Liqeni i Tiranës | 94000.0 | 99.0 | 99 | 3 | 1.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21706 | 21729 | 2656 | Apartment | New | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 98000.0 | 108.0 | 118 | 2 | 2.0 | 1.0 | 0.0 | 0.0 | NaN | mei |
| 21707 | 21730 | 2654 | Apartment | Used | Available | Albania | Tirana | Tirana | Ali Demi | Ali Demi | Ali Demi | 65000.0 | 120.0 | 0 | 4 | 0.0 | 3.0 | 0.0 | 0.0 | NaN | mei |
| 21708 | 21731 | 2652 | Apartment | Used | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 55000.0 | 72.0 | 80 | 3 | 1.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21709 | 21732 | 2651 | Apartment | Used | Sold | Albania | Tirana | Tirana | Blv. Zogu i Pare | Blv. Zogu i Pare | Blv. Zogu i Pare | 74000.0 | 74.0 | 80 | 3 | 2.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21710 | 21733 | 2650 | Apartment | Used | Sold | Albania | Tirana | Tirana | Don Bosko | Don Bosko | Don Bosco | 84000.0 | 103.0 | 111 | 3 | 2.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21711 | 21734 | 2649 | Apartment | Used | Sold | Albania | Tirana | Tirana | Tirana e Re | Tirana e Re | Tirana e Re | 198000.0 | 138.0 | 0 | 4 | 2.0 | 3.0 | 0.0 | 0.0 | NaN | mei |
| 21712 | 21735 | 2648 | Apartment | Used | Withdrawn | Albania | Tirana | Tirana | Tirana e Re | Tirana e Re | Tirana e Re | 95000.0 | 105.0 | 115 | 3 | 2.0 | 2.0 | 0.0 | 0.0 | NaN | mei |
| 21713 | 21736 | 2645 | Apartment | New | Sold | Albania | Tirana | Tirana | Kashar | Kashar | Kashar | 22000.0 | 48.0 | 0 | 1 | 1.0 | 0.0 | 0.0 | 0.0 | NaN | mei |